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Background and purpose — There is no consensus regarding 
the clinical relevance of gender-specific prostheses in total knee 
arthroplasty (TKA). We summarize the current best evidence 
in a comparison of clinical and radiographic outcomes between 
gender-specific prostheses and standard unisex prostheses in 
female patients. 

Methods — We used the PubMed, Embase, Cochrane, Science 
Citation Index, and Scopus databases. We included randomized 
controlled trials published up to January 2013 that compared 
gender-specific prostheses with standard unisex prostheses in 
female patients who underwent primary TKAs. 

Results — 6 trials involving 423 patients with 846 knee joints 
met the inclusion criteria. No statistically significant differences 
were observed between the 2 designs regarding pain, range of 
motion (ROM), knee scores, satisfaction, preference, compli- 
cations, and radiographic results. The gender-specific design 
(Gender Solutions; Zimmer Inc, Warsaw, Indiana) reduced the 
prevalence of overhang. However, it had less overall coverage of 
the femoral condyles compared to the unisex group. In fact, the 
femoral prosthesis in the standard unisex group matched better 
than that in the gender-specific group. 

Interpretation — Gender-specific prostheses do not appear to 
confer any benefit in terms of clinician- and patient-reported out- 
comes for the female knee. 



Women account for almost two-thirds of knee arthroplasties 
(Kurtz et al. 2007). Recently, a possible effect of gender on 
functional outcomes and implant survivorship has been identi- 
fied (Vincent et al. 2006, Ritter et al. 2008, Kamath et al. 2010, 
Parsley et al. 2010, O'Connor 2011). Gender differences in 
the anatomy of the distal femur are well documented (Conley 
et al. 2007, Yue et al. 2011a,b, Yan et al. 2012, Zeng et al. 



2012). Women tend to have a less prominent anterior condyle 
(Conley et al. 2007, Fehring et al. 2009), a higher quadriceps 
angle (Q-angle) (Hsu et al. 1990, Woodland et al. 1992), and 
a reduced mediolateral to anteroposterior aspect ratio (Chin et 
al. 2002, Chaichankul et al. 2011). Investigators have found 
that standard unisex knee prostheses may not equally match 
the native anatomy in male and female knees (Clarke and 
Hentz 2008, Yan et al. 2012). A positive association between 
the femoral component size and the amount of overhang was 
observed in females, and femoral component overhang (a 3 
mm) may result in postoperative knee pain or reduced ROM 
(Hitt et al. 2003, Lo et al. 2003, Mahoney et al. 2010). 

The concept of gender-specific knee prostheses was intro- 
duced to match these 3 anatomic differences in the female 
population (Conley et al. 2007). It includes a narrower medio- 
lateral diameter for a given anteroposterior dimension, to 
match the female knee more closely. Additionally, the ante- 
rior flange of the prothesis was modified to include a recessed 
patellar sulcus and reduced anterior condylar height (to ovoid 
"overstuffing" during knee flexion) and a lateralized patel- 
lar sulcus (to accommodate the increased Q-angle associated 
with a wider pelvis). 

Several randomized controlled trials (RCTs) have failed 
to establish the superiority of the gender-specific prosthesis 
over the unisex knee prosthesis in the female knee (Kim et al. 
2010a,b, Song et al. 2012a, Thomsen et al. 2012, von Roth et 
al. 2013). In contrast, other studies have found higher patient 
satisfaction and better radiographic fit in the gender-specific 
TKAs than in the standard unisex TKAs (Clarke and Hentz 
2008, Parratte et al. 201 1, Yue et al. 2014). We therefore per- 
formed a systematic review and meta-analysis to compare the 
clinical and radiographic results of TKA in female patients 
receiving gender-specific prostheses or standard unisex pros- 
theses. 
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Material and methods 

Our systematic review and meta-analysis was conducted 
according to the PRISMA guidelines. 

Search strategy and eligibility criteria 

With the help of a librarian, we performed an electronic search 
of the current literature up to January 2013 using the PubMed, 
Embase, Cochrane, Science Citation Index, and Scopus data- 
bases. We used the following key words: knee arthroplasty, 
knee replacement, knee arthroplasties, knee replacements, 
TKA, TKR, gender, sex, female, gender-specifc, sex-specific, 
and female-specific. Additional strategies to identify relevant 
studies included: (1) a manual search of the table of contents 
of 5 major orthopedic journals (Journal of Bone and Joint 
Surgery (American and British); Clinical Orthopaedics and 
Related Research; Journal of Arthroplasty; Knee Surgery, 
Sports Traumatology, Arthroscopy; and The Knee) from 2005 
through December 2012; (2) a literature search of the bibliog- 
raphies of all the retrieved reports that evaluated clinical out- 
comes following the use of gender-specific knee prostheses; 
(3) a review of the titles of meeting abstracts from Orthopae- 
dic Proceedings. No restrictions were placed on the origin or 
language of the publications. 

First, all the articles comparing the outcomes of gender- 
specific and unisex prostheses were retrieved. In order to be 
included in the systematic review and meta-analysis, the stud- 
ies had to meet the following criteria: (1) they had to have been 
described as a "randomized controlled trial (RCT)" related to 
primary TKA; (2) the subjects of the study had to have been 
women with severe knee arthrosis; (3) female-specific knee 
prostheses had to have been selected; and (4) follow-up had 
to have been more than 1 year. We excluded case reports, 
case series, editorials, commentaries, letters to the editor, and 
reviews, but these were read to identify any potential articles. 

Study identification 

2 reviewers (TC and CZ) independently screened titles, 
abstracts, and full-text papers for possible relevance. The 
reviewers were not blinded as to authors, geographic location, 
and hospital affiliation. Disagreements between the reviewers 
were resolved by consensus. If a disagreement remained, a 
third reviewer resolved the disagreement by consensus after 
discussing the inclusion and exclusion criteria with the other 
reviewers. 

Data extraction and assessment of methodological 
quality 

For each eligible study, one reviewer (TC) extracted relevant 
data including country of origin, year of publication, financial 
support, demographic data (age and body mass index (BMI)), 
postoperative pain, range of motion (ROM), knee scores 
(e.g. Knee Society score (KSS), Hospital for Special Surgery 
score (HSS), or Western Ontario and McMaster Universities 



Potentially relevant studies 
identified and screened 
for retrieval 
(n = 314) 






Excluded for ineligibility of 
publication type, study design, 
participants, or comparisons 
by browsing titles or abstracts 
(n = 304) 








Studies retrieved 
for more detailed 
full text evaluation 
(n = 10) 








Excluded (n = 4): 

- results were from female and 
male patients, 2 

- participants were not randomly 
allocated, 1 

- patients were followed up for 
less than 12 months, 1 








Studies included 
with eligible data 
(n = 6) 







Figure 1. Flow diagram of selection process. 



osteoarthritis score (WOMAC)), postoperative complications, 
patient satisfaction and preference, femoral component fit, and 
radiographic results. These data were then cross-referenced by 
the second reviewer. If necessary, the original authors were 
contacted for more information and clarification of data. 

2 of the present authors (JW and MC) independently 
assessed the methodological quality of each included study 
regarding randomization method, allocation concealment, 
blinding of patients/assessors, equality of baseline charac- 
teristics, adequate description of inclusion/exclusion criteria, 
similarity of postoperative program, sample size calculation, 
and the proportion of patients lost during follow-up. The 3 
options for the results of quality assessment are as follows: 
"Yes" indicates a low risk of bias, "Unclear" indicates a mod- 
erate risk of bias, and "No" indicates a high risk of bias. 

Statistics 

Wherever appropriate, the data from the studies were pooled 
together using Comprehensive Meta-analysis (Biostat Inc., 
Englewood, New Jersey). Statistical heterogeneity was 
assessed using I-square statistics. We chose to compare cate- 
gorical variable data using risk ratio (RR) and continuous vari- 
able data using weighed mean difference (WMD) as summary 
statistic with corresponding 95% confidence intervals (CIs). 
The results were pooled using fixed-effects model (Mantel- 
Haenszel test); but when there was significant heterogeneity 
in results between studies (I 2 > 50%), a random-effects model 
was used (DerSimonian-Laird method). 



Results 

The literature search identified 314 studies, of which 308 were 
eliminated by screening the title, reading the abstract, or read- 
ing the entire article (Figure 1). 2 comparative studies were 
excluded from this systematic review because both female and 
male patients were recruited for evaluation (Clarke and Hentz 
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A Author, year, country 
B No of patients 
C No of knees 

D No of gender-specific (GS) total knee arthroplasties 
E No of control total knee arthroplasties 
F Mean age (SD) or (range) 

a GS 

b Control 

G Mean body mass index (SD) or (range) 
a GS 
b Control 
H Type of prosthesis 
a GS 
b Control 

CR: posterior cruciate-retaining 
PS: posterior stabilized 
LPS: NexGen Legacy Posterior Stabilized 
I Outcome and measures 
KSS: Knee Society Score 
ROM: range of motion 

WOMAC: Western Ontario and McMaster Universities osteoarthritis index 
J Follow-up, years 



2008, Tanavalee et al. 2011). Another comparative study was 
also excluded due to non-randomized treatment allocation 
(Lionberger et al. 2012). A prospective study was excluded 
because it reported intraoperative ROM, not postoperative out- 
comes (Song et al. 2012b). Finally, 6 RCTs met our eligibility 
criteria (Kim et al. 2010a,b , Kumar et al. 2012, Singh et al. 
2012, Song et al. 2012a, Thomsen et al. 2012). They included 
846 knees, in which 423 female patients had simultaneous bilat- 
eral TKA with a gender-specific prosthesis in one knee and a 
standard unisex prosthesis in the contralateral knee. The sample 
size of the data ranged from 48 to 276 cases with an average 



age of 64-7 1 years and a mean follow-up time of 2. 1-3 .2 years. 
Of the 6 studies, 3 specifically stated that there had been no 
financial support from a sponsoring medical company (Kim et 
al. 2010a,b, Thomsen et al. 2012) and the other 3 did mention 
whether support had been received (Kumar et al. 2012, Singh et 
al. 2012, Song et al. 2012a). Of the 6 studies, 3 were conducted 
in Korea (Kim et al. 2010a,b, Song et al. 2012a), 2 in India 
(Kumar et al. 2012, Singh et al. 2012), and one in Denmark 
(Thomsen et al. 2012). 5 papers did not describe the randomiza- 
tion procedures in an adequate way; only 1 study was random- 
ized based on a computer-generated list (Tables 1 and 2). 
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Table 2. Methodological quality of the studies included 
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Range of motion 

5 articles reported the active ROM of the knee in the supine 
position (Kim et al. 2010a,b, Singh et al. 2012, Song et al. 
2012a, Thomsen et al. 2012), and only 3 studies provided the 
active ROM outcome in the weight-wearing position (Kim 
et al. 2010a,b, Song et al. 2012a). Data pooled with a fixed- 
effects model showed no statistically significant differences 
in the active ROM under weight-bearing conditions between 
the gender-specific and unisex groups (WMD = 0.66, CI: -2.0 
to 3.4; p = 0.6). Under non-weight-bearing conditions, the 
gender-specific knee prostheses did not offer a statistically 
significant advantage in increasing ROM (WMD = 0.57, CI: 
-2.4 to 3.5; p = 0.7). 

Clinical knee scores 

The KSS was used in 4 RCTs (Kim et al.2010a,b, Kumar et 
al. 2012, Singh et al. 2012), the HSS was used in 4 (Kim et 
al. 2010a,b, Kumar et al. 2012, Singh et al. 2012), and the 
WOMAC was used in 3 (Kim et al. 2010a,b, Song et al. 
2012a). The KSS was similar between the 2 groups (WMD = 
-0.45, CI: -1.5 to 0.55; p = 0.4). For the WOMAC score at the 
final follow-up, the WMD was -0.94 (CI: -3.1 to 1.2; p = 0.4), 
with little between-study heterogeneity (p = 0.8, 12 = 12%). 
For the studies reporting the HSS at the final follow-up, the 
WMD was 0.1 1 (CI: -1.3 to 1.5; p = 0.9), with little between- 
study heterogeneity (p = 0.8; 12 = 0%). 

Participant preference and satisfaction 

Of the 323 patients who had bilateral TKAs with different 
types of prosthesis (gender-specific or unisex), 273 patients 
(3 trials) reported on preference (Kim et al. 2010a,b, Song et 
al. 2012a). 216 patients (79%) had no preference, 27 (10%) 
preferred the standard prosthesis and 30 (11%) the gender- 
specific prosthesis. Although more patients preferred the gen- 



der-specific prostheses to the unisex prostheses, meta-analysis 
of these trials did not show any statistically significant differ- 
ence between the numbers of the patients who preferred one 
or other type of prosthesis (RR = 1.1, CI: 0.68-1.8; p = 0.7). 

Of the 6 bilateral trials, participant satisfaction using a 
visual analog scale (VAS) questionnaire in bilateral cases was 
documented in 3 (Kim et al. 2010a,b, Thomsen et al. 2012), 
involving 247 patients. The mean satisfaction score was simi- 
lar in both groups (WMD = 0.04, CI: -0.52 to 0.60; p = 0.9). 
In 1 trial (Kumar et al. 2012), 30 patients reported their overall 
satisfaction after surgery using the British Orthopaedic Asso- 
ciation (BOA) patient satisfaction score. The authors found 
that there were no statistically significant differences in BOA 
score between the 2 groups at either of 2 follow-up periods. 

Pain 

Of the 6 studies, 5 provided data on postoperative pain. 1 1 % 
(49/446) of the gender- specific knees gave pain, the same 
proportion (48/446) as in the unisex knees. A visual analog 
scale (VAS) was used to assess postoperative pain in the study 
by Thomsen et al. (2012), who found no statistically signifi- 
cant differences between the 2 prosthetic designs at 3 and 12 
months postoperatively. 3 of the studies included adopted the 
KSS and HSS subscales for knee pain (Kim et al. 2010a,b, 
Singh et al. 2012). At the time of the latest follow-up, the pain 
scores were similar in both the standard and gender- specific 
groups. 

Fit of the femoral component 

Meta-analysis of 4 trials revealed that the gender- specific 
femoral component did not fit better than the standard femoral 
component (Kim et al. 2010a,b, Kumar et al. 2012, Thomsen 
et al. 2012) (RR = 0.45, CI: 0.36-0.56; p < 0.001). Specifically, 
in the female patients with the unisex prosthesis, the femoral 
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Figure 2. Comparison of the effects of gender-specific prosthesis and standard design on complications after surgery. 
(■) The weighting given to the trial in the overall pooled estimate, taking into account the number of participants and the 
amount of between-study variation (heterogeneity). (♦) The combined effect size. 



prosthesis was closely matched in 172 knees (56%), overhung 
in 41 (13%), and undercovered the bone in 94 (31%). In the 
female patients with the gender-specific prosthesis, 77 knees 
(25%) had a close fit and 230 knees (75%) had undercoverage. 

Complications 

Postoperative complications and adverse events were reported 
in 4 trials (Kim et al. 2010a,b, Singh et al. 2012, Thomsen 
et al. 2012), but these studies did not document any adverse 
events related to the knee prostheses. The complication rates 
were low, and they were comparable in both groups (RR = 1.0, 
CI: 0.42-2.3; p = 1.0, 12 = 0%) (Figure 2). 

Radiographic findings 

Of the 6 articles, 5 mentioned radiographic results (Kim et al. 
2010a b, Kumar et al. 2012, Song et al. 2012a, Thomsen et 
al. 2012). There were no statistically significant differences 
between the groups regarding radiographic parameters includ- 
ing alignment of the limb (femoro-tibial angle), component 
positioning, cover of the tibial surface, the level of the joint 
line, anterior and posterior condylar offset, and patellar tilt 
angle. None of the knees in either group had a complete radio- 
lucent line wider than 1 mm around any component (Kim et al. 
2010a,b, Kumar et al. 2012). 



Discussion 

We found that the gender-specific design did not provide any 
clinical advantages over the standard unisex design in the 
female knee with respect to knee scores, patient preferences, 
patient satisfaction, and radiographic outcomes. 

The strengths of meta-analysis study lie in the large sample 
size and the inclusion of only RCTs (level I or II evidence). Of 
the 6 eligible trials, 5 performed simultaneous bilateral TKAs, 
which allowed comparisons of clinical and radiographic out- 
comes, satisfaction, and preference in the same patients and 
detection of subtle differences in prosthesis performance. This 
method has the advantage that it minimizes potential bias 



introduced by preoperative patient-related factors such as age, 
body weight, and pain threshold. 

Our study had several potential weaknesses. Firstly, the 
follow-up period was short. Secondly, our data pertain only 
to a single implant system (Zimmer Gender Solutions Knee). 
Additional studies of the femoral prosthesis in other knee 
systems are needed before our conclusions can be applied 
to other designs. Thirdly, there is considerable anatomical 
variation in the knee among different ethnic groups. In our 
study, 190 of 423 patients (45%) were from the Asian popu- 
lation, whose knees have generally smaller anthropometric 
measurements than those of Caucasians (Vaidya et al. 2000, 
Urabe et al. 2008, Chaichankul et al. 201 1, Yue et al. 201 la). 
They require a smaller range of component sizes in TKAs. 
Thus, the ethnic differences in anthropometric measure- 
ments may limit the generalizability of our results to western 
populations. 

The need for gender-specific knee arthroplasties is based 
on the following assumptions: (1) women have results that 
are inferior to those of men after TKA, and (2) traditional 
prosthesis designs have failed to address these differences. A 
number of clinical studies have refuted the idea that women 
have worse outcomes than men using traditional total knee 
designs (MacDonald et al. 2008, Merchant et al. 2008, Ritter 
et al. 2008, Dalury et al. 2009, Johnson et al. 2011). In fact, 
some studies have found that women achieve essentially the 
same results as men, or even better (Merchant et al. 2008, 
Parsley et al. 2010, O'Connor 2011). A systematic review by 
Merchant et al. (2008) found no evidence for anatomical dif- 
ferences between men's and women's knees that would jus- 
tify a female-specific design. The average anatomical differ- 
ences between male and female knees can be explained by 
the smaller height and size of women on average, not by their 
gender (Merchant et al. 2008). A recent study showed that the 
shape of the knee is not only dependent on gender, but also on 
the morphotype of the patient (Bellemans et al. 2010). Pros- 
thesis design should take inter-individual variations in knee 
joint anatomy into account (Dargel et al. 201 1, Gillespie et al. 
201 1). However, these anatomical differences may be so small 
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that have no clinical effect, which would agree with the find- 
ings of the present study. 

Tanavalee et al. (2011) intraoperatively evaluated the over- 
hang of the cutting block on the posteromedial edge of the 
femoral condyle. They found that the overall percentage of 
gender-specific prosthesis selection was higher in female 
patients than in male patients. This phenomenon frequently 
occurs in medium-sized to large-sized female patients. The 
medial or lateral overhang of unisex prostheses was worse in 
female knees than in male knees (Clarke and Hentz 2008, Guy 
et al. 2012). Yan et al. (2012) reported a higher prevalence 
of prosthetic overhang in women with standard unisex knee 
prostheses. Consistent with these findings, our meta-analysis 
showed that gender-specific design reduced medial or lateral 
overhang in female patients whereas there was a higher preva- 
lence of underhang than with unisex prostheses. Although a 
previous study confirmed that femoral component overhang 
may create postoperative pain due to soft tissue irritation and 
soft tissue imbalance (Mahoney and Kinsey 2010), our results 
suggest that the difference in condylar cover does not affect 
pain scores and knee function. Based on these contradictory 
results, the question remains whether the modifications made 
to the gender-specific design are extensive enough to closely 
match femoral anatomy in female patients and offer any clini- 
cal advantages. Notably, the higher incidence of underhang 
in the gender-specific group, which exposed more cancellous 
bone than with the NexGen standard prostheses, could be a 
source of higher perioperative blood loss and may induce 
increased ostolysis from wear debris with longer follow-up 
(Hitt et al. 2003, Kim et al. 2010 a, b). 

Overstuffing of the patellofemoral compartment may be 
associated with pain and reduced ROM. The use of a stan- 
dard TKA could possibly lead to overstuffing because of a less 
prominent anterior condyle in the female knee. Despite the 
fact that reduced height of the anterior flange and the deeper 
trochlear groove would help to prevent overstuffing of the 
patellofemoral joint and improve patellar tracking in the gen- 
der-specific design, we found similar postoperative pain and 
ROM when comparing the 2 knee prostheses. 

Improvement in clinical outcomes following knee arthro- 
plasty is usually measured by score scales, which are asso- 
ciated with a ceiling effect and poor ability to discriminate 
between high-functioning arthroplasty patients (Senden et 
al. 2011, Na et al. 2012). More sensitive outcome scores are 
needed. Gait analysis could provide more objective param- 
eters, such as measurements of sagittal plane kinematics and 
kinetics (Borjesson et al. 2005, Catani et al. 2012). 

A recent gait analysis study suggested that there was no 
difference between the 2 knee prosthesis designs in terms of 
kinematic, kinetic, and temporo-spatial parameters (Thomsen 
et al. 2012). Given the inadequate correlations between knee 
scores and gait parameters, further gait analysis studies moni- 
toring the functional benefit of design in TKAs are needed 
(Liebensteiner et al. 2008). 



In conclusion, gender-specific changes to knee prosthe- 
ses do not translate into improved outcomes. Furthermore, 
gender-specific protheses would not necessarily fit any better 
than unisex designs. 
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